Temporal difference models describe higher-order learning in humans
نویسندگان
چکیده
منابع مشابه
Evaluation of Models to Describe Temporal Growth in Local Chickens of Ghana
The logistic, Gompertz, Richards and asymmetric logistic growth curve models were fitted to body weight data of local Ghanaian chickens and French SASSO T44 chickens. All four growth models provided good fit for each sex by genotype growth data with R2 values ranging from 86.7% to 96.7%. The rate constant parameter, k, ranged between 0.137 and 0.271 and were significantly different from zero fo...
متن کاملThe application of temporal difference learning in optimal diet models.
An experience-based aversive learning model of foraging behaviour in uncertain environments is presented. We use Q-learning as a model-free implementation of Temporal difference learning motivated by growing evidence for neural correlates in natural reinforcement settings. The predator has the choice of including an aposematic prey in its diet or to forage on alternative food sources. We show h...
متن کاملDifference Equations in Massive Higher Order Calculations
The calculation of massive 2–loop operator matrix elements, required for the higher order Wilson coefficients for heavy flavor production in deeply inelastic scattering, leads to new types of multiple infinite sums over harmonic sums and related functions, which depend on the Mellin parameter N. We report on the solution of these sums through higher order difference equations using the summatio...
متن کاملDual Temporal Difference Learning
Recently, researchers have investigated novel dual representations as a basis for dynamic programming and reinforcement learning algorithms. Although the convergence properties of classical dynamic programming algorithms have been established for dual representations, temporal difference learning algorithms have not yet been analyzed. In this paper, we study the convergence properties of tempor...
متن کاملPreconditioned Temporal Difference Learning
LSTD is numerically instable for some ergodic Markov chains with preferred visits among some states over the remaining ones. Because the matrix that LSTD accumulates has large condition numbers. In this paper, we propose a variant of temporal difference learning with high data efficiency. A class of preconditioned temporal difference learning algorithms are also proposed to speed up the new met...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nature
سال: 2004
ISSN: 0028-0836,1476-4687
DOI: 10.1038/nature02581